RESOLVENTS OF ^-DIAGONAL OPERATORS 



UFFE HAAGERUP*^', TODD KEMP^^^ AND ROLAND SPEICHER'^) 



Abstract. We consider the resolvent (A — a)^^ of any .^^-diagonal operator a in a Ili-factor. Our 
main theorem (Theorem 1.1) gives a universal asymptotic formula for the norm of such a resolvent. 
En route to its proof, we calculate the ^-transform of the operator |A — cj^ where c is Voiculescu's 
circular operator, and give an asymptotic formula for the negative moments of |A — af for any 
diagonal a. We use a mixture of complex analytic and combinatorial techniques, each giving finer 
information where the other can give only coarse detail. In particular, we introduce partition structure 
diagrams in Section 4, a new combinatorial structure arising in free probability. 



1. Introduction 

1.1. Motivation and Main Results. In this paper, we develop a number of universal norm esti- 
mates related to free probability theory. We are, in particular, concerned with ^-diagonal operators, 
which are precisely defined on page 3. Originally introduced by Nica and Speicher in [13], they 
have been considered by many authors in papers including [3, 4, 5, 6, 8, 10, 11, 12, 14, 17]. The class 
of ^-diagonal operators includes both Voiculescu's circular operator and Haar unitary operators, 
and is very large (the distribution of the real part of an ^-diagonal operator can be prescribed 
arbitrarily). They are important in recent work on the invariant subspace conjecture relative to a 
Ill-factor (cf. [5, 6], and have been shown to maximize free entropy given distribution constraints. 

This paper is, in a sense, a continuation of [10] and [8], which examined an important norm in- 
equality (the Haagerup inequality, [2]), originally in the context of Haar unitary operators, general- 
ized to all ^-diagonal elements. The Haagerup inequality compares operator norm to L^-norm for 
homogeneous (non-commutative) polynomials in operators. In [8], the second author considered 
an alternate formulation based on the dilation a ^ ra for r G (0, 1), acting on the C*-algebra gen- 
erated by a family of free generators a. Of interest are the elements 1 -|-ra + (ra)^ + •• • = (1 — ra)~^. 
In that context, he proved a non-sharp version of the lower-bound in Theorem 1.1 below, for a sub- 
class of ^-diagonal operators (those with non-negative free cumulants). The current paper can be 
viewed as providing the sharp norm inequality, for all ^-diagonal operators. 

Our main theorem. Theorem 1.1, gives the precise rate of norm blow-up of the resolvent of 
an ^-diagonal operator near its spectral radius. It is worthy of note for two reasons. First, it is 
notoriously difficult to calculate resolvent blow-up rates, while we have calculated the rate for all 
^-diagonal resolvents. Second, the result is universal: the rate is always polynomial with exponent 
—3/2, and the constant is a product of a uniform factor with a quantity determined only by the 
4th moment of the operator a. 

Theorem 1.1. Let a be an ^-diagonal operator in a lli factor with trace ip, normalized so that (p{aa*) = 
\\a\\2 = 1. Set v{a) = \\a\\f — 1. Then v{a) > iff a is not a Haar unitary, and in this case, for A > 1, 




\\{\-a)-^\\^ \—^/v{a)- -— as A j 1. (1.1) 



In proving Theorem 1.1, we develop several auxiliary results of independent interest. The spe- 
cial case that a is Voiculescu's circular element c affords an example where non-asymptotic calcu- 
lations may be done completely explicitly. In that case, we prove the following result on page 5, 
which (as we show) can be used to prove this special case of the main theorem. 



(2) This work was partially supported by NSF Grant DMS-0701162. 
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Theorem 2.2. Let cbe a circular operator of unit variance, and let A e M. Then 

Theorem 2.2 is proved following its statement via combinatorial techniques; we reprove it in 
Section 3.1 using the analytic techniques developed there. We go on to use that analysis to calcu- 
late, to leading order, the negative moments of the operator |A — for any ^-diagonal a. The 
result, which appears on page 18, follows. 

Theorem 3.11. Let k be a non-negative integer, and let a satisfy the conditions of Theorem 1.1. Then as 
Ail, 

ip{\X-a\ >)^C^ (A2_i)3fc+i ' 
where C^^ = 2i+i ^k) (type 2) Fuss-Catalan numbers. 

We give two different proofs of this theorem: one complex analytic, in Section 3, and the other 
combinatorial, in Section 4.1. Theorem 3.11 by itself yields the sharp lower bound of Theorem 

1.1, as detailed in Section 4.2; the analytic techniques of Section 3 extend to prove this bound is 
also sharp from above. In addition, our combinatorial approach demonstrates that the negative 
moments are in fact polynomials in appropriate quantities. The theorem, appearing on page 27, is 
as follows. 

Theorem 4.8. Let k > 0, and let a satisfy the conditions of Theorem 1.1. Then there is a polynomial P^^i 
in two variables so that 

V.(|A-a|-('=+^)) = P,V (a2^'^ 
for A > 1. 

The proof of Theorem 4.8 led to the development of a new class of combinatorial objects we call 
partition structure diagrams, introduced in Section 4.1. 

1.2. Background. Following is a brief description of those results and techniques from both the 
complex analytic and combinatorial sides of free probability theory that we use in this paper. They 
are here largely as a means to fix notation. The reader is directed to the papers [4, 5, 8, 10] and the 
book [15] for further reading. 

The arena for all of what follows is a IIi -factor £/ with trace tp. Operators in £/ are non- 
commutative random variables. If x G i?/ is self-adjoint, it has a spectral resolution whose projec- 
tions are in £/ ; the measure fix = y^oE^ is the distribution or spectral measure of x. Equivalently, fix is 
the unique probability measure on M whose moments / fixidt) are given by the moments ip{x^) 
for n G N. Even if x G ^ is not self-adjoint, we therefore refer to the collection of all (/^-moments 
of monomials in x and x* as the distribution of x. 

Given a probability measure /x on M, its Cauchy transform is the analytic function defined in 
the upper half -plane C+ by 



Gm(^)= / iMrfi), (1.2) 

The ^-transform of the measure, is the analytic function defined in a neighbourhood of 
determined by the functional equation 

G^i^^iz) + l/z) = z, zeC+, |z| small. (1.3) 

For a known ^-transform ^^j. Equation 1.3 in fact determines G^ on a sector in C+, and thence 
on all of C+ by analytic continuation, modulo the asymptotic restriction that lim|^|_^oo zG^{z) = 1. 
This relationship shows that the measure p, can be recovered from its .^-transform, via the Stieltjes 
inversion formula: 
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^{dt) = lim QGJt + ie) dt. (1.4) 

Equation 1.4 should be interpreted in weak form (that/ f{t)fi{dt) = — Mim^jo / f{t)'^Gi^i{t + ie)dt 
for / G Cc(M)) in general, but in the case that /x has a density p with respect to Lebesgue measure 
fi{dt) = p{t) dt, Equation 1.4 yields p{t) as the limit on the right-hand-side. 

Given x,y e £/ self-adjoint, they are called free if ^^^+y = ^^i^ + ^ny- Freeness can be written 
in other more combinatorial forms by considering the additivity of .^-transforms as a collection 
of statements about Taylor coefficients. It is easy to verify that ^fj(O) = for any measure p, and 
so in general we have 

3^^{z) = Ki{p) + K2{p) Z + K3(/i) 2;^ H 

for some scalars Kn{p) called the free cumulants of p. Thinking of 1% and k„ indexed by a ran- 
dom variable rather than its distribution, we can polarize and express k„ as an n-linear functional: 
. . . , Xn], where . . . , x] = Kn{px)- In this language, freeness can be stated thus; random 
variables are free if all their mixed free cumulants vanish. This parallels the classical connec- 
tion between independence of random variables and their classical cumulants (also known as 
semi-invariants). It also provides an extension of the notion of freeness to any collection of (not 
necessarily self-adjoint) random variables. 

The relationship between moments and free cumulants is given by the moment cumulant formula: 

ip{xi---Xn)= ^ . . . (1.5) 

■K&NC{n) 

Here NC{n) denotes the lattice of non-crossing partitions of the ordered set {1, . . . , n}. Given a par- 
tition vr = {Bi, . . . , Br} (with Bj C {1, . . . , n}), the quantity Ktj[xi,. . . , x„] is equal to the product 
of the r terms k(Sj)[2;i, . . . , x^], where if 5 = {ii,...,im} thenK(5)[xi, . . . = Km[a^n, • • • ,a:^^„]■ 
For example, if vr = {{1,4, 5},{2,3}},thenK^[xi,...,X5] = «;3[xi, X4, X5] k-2,[x2,x^]. 

An non-commutative random variable o € is called .^-diagonal if, among all mixed free 
cumulants in a, a*, the only non-zero ones are among 

K2n[a, a*, . . . , a, a*], K2„[a*, a, . . . , a*, a], 

for some positive integer n. Prominent examples of .^-diagonal operators are Haar unitary op- 
erators and Voiculescu's circular operator c, often represented in the form c = -^{s + is') where 
s,s' are free semicircular random variables — self -adjoint operators with distribution ps{dt) = 
Ps'{dt) = ^\/4 — t'^l{\t\<2})- The class of ^-diagonal operators is closed under free sum and 
product and taking powers, and for any compactly-supported probability measure p on M_|_ there 
is an ^-diagonal operator a with paa* = P- 

The general moment-cumulant formula takes a special form in the case of ^-diagonal opera- 
tors. Let a he ^-diagonal, and consider any monomial in a, a*: a*"^a™i • • • a*"'=a'^'= for ni, . . . , ruk 
non-negative integers. The following formula is a consequence of the definition of ^-diagonality, 
and is proved in [8]. 

(/?(a*"ia'"i •••a*'^'=a"^'=) = a*'™S . . . , a•"^ a*'™'=]. (1.6) 

TreNC{ni,mi,...,nk,mk) 

Here NC{ni, m2, ■ ■ ■ ,nk, nik) denotes the set of non-crossing partitions vr of the list of length ni + 
mi + • • • + nfc + rrik with the property that each block of vr alternately connects as and a*s. Commas 
have been added in the exponents of the cumulants to emphasize that the arguments are not 
products. The set of pairings with this property is denoted NC2{ni,mi, . . . , nj^, mk). 

In fact, it is sometimes useful to consider cumulants with products as arguments (thus the need 
for the commas above to distinguish). The following theorem (Theorem 11.12 in [15]) is a powerful 
computational tool we will use in Section 2. 
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Figure 1. Two non-crossing partitions in NC{2, 3, 4, 3), represented in linear for- 
mat (top) and on the disc (bottom); the latter representation will be more useful in 
Section 4.1. The first partition is a pairing in NC2{2, 3, 4, 3). 

Theorem 1.2. Let i = {ii, . . . ,in) be an n-tuple of natural numbers, and let ai, . . . , a\i\ be random vari- 
ables in a non-commutative probability space. Consider the products 

Ai = ai - • • ai-^ A2 = flii+i • • • fflji+ia • • • An = fli^H hin-i+l ■ ■ ■ '^iiH hin ■ 

The free cumulants of these product variables are given in terms of the free cumulants of the aj themselves 
by 

Kn[Ai, . . . ,An] = ^ K^[ai, . . . ,a|i|], (1.7) 

7reiVC(|i|) 

TVO„ = l|i 

where 0„ is the partition whose blocks are the intervals {1, . . . , ii}, {ii + 1, . . . , ii + Z2}, . . . , {ii -I- • • • -|- 
in-l + 1, . . . ,ii H h in}- 

Remark 1.3. For notational convenience, we will express the relationship between the tuples A = 
[^1, . . . , An] and a = [ai, . . . ,a\\ in Equation 1.7 by a = A. For example, if i = (2, 1, 2, 1) then 
A = [aia2, as, 0405, a&\ and so A = [ai, 02, as, 04, as, ae]. The use of the notation is in summations 
over i, where it is cumbersome to explicitly enumerate the break-points between products. 

The V in Equation 1.7 denotes the join in the lattice A^C(|i|). The meaning of the condition vr V 0^ = 
l|i| is as follows: the blocks of vr must connect the blocks of 0„. To be precise: given any two points 
p, g in {1, . . . , |i|}, there must be a path p = pi ~cti P2 ' • • ^^^-1 Pr = q where aj G {vr, 0„} for 
j = 1 ... r — 1. Indeed, the sequence aj can be chosen to alternate between vr and 0„. Figure 2 gives 
examples. 

1.3. Organization. This paper is organized as follows. In Section 2, we address our main theo- 
rem through the special case of a circular operator c, the pre-eminent example of an ^-diagonal 
element. In Section 2.1, we calculate the ^-transform of the operator |A — for any scalar A, 
using combinatorial means (primarily judicious application of Theorem 1.2). In Section 2.2, we 
us this ^-transform to explicitly determine the support of the spectral measure of |A — cp; its left 
boundary point represents the norm of the resolvent (A — c)^^. In this case, the measure itself can 
be completely determined. 
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TTl 7r2 TTs 

Figure 2. Three examples of partitions in NC{6). With multi-index i = (2,1,2, 1), 
TTl and 712 do connect all the blocks of O4, and so would be included in the sum in 
Equation 1.7; vra leaves the first block of O4 isolated, and so is not included in this 
sum. 

The exact calculations of Section 2.2 cannot be extended to the general ^-diagonal case, and so 
we proceed to develop analytic arguments to prove the asymptotic statement of Theorem 1.1. In 
Section 3, we demonstrate the power of working with the symmetrizations of spectral measures 
(via Equations 3.2 and 3.3). Section 3.1 shows how Theorem 2.2 can be obtained directly from 
these analytic means. Section 3.2 gives a general analytic continuation argument from the above- 
mentioned equations that yields a useful power-series inversion formula, which is then used in 
Section 3.3 to calculate (to leading order) the negative absolute moments of the resolvent (A — a)~^ 
(theorem 3.11). These techniques are then pushed through to give a complete proof of Theorem 
1.1 in Section 3.4. 

Finally, in Section 4, we examine the combinatorial structures underlying the negative absolute 
moments of the resolvent (A — a)^^. In Section 4.1, we introduce partition structure diagrams, a new 
way to view the basic structure of partitions appearing in Equation 1.6 for ^-diagonal operators, 
and use their enumeration to provide a bijective combinatorial proof of the refinement (Theorem 
4.8) of Theorem 3.11. Then, in Section 4.2, we show how knowledge of the asymptotics of negative 
moments alone can be used to recapture the sharp lower-bound of Theorem 1.1. 

2. The Circular Case 

Let c be a circular operator of unit variance in a Ili-factor 'tf. Since c is ^-diagonal, its spectral 
radius is ||c||2 = 1 by our choice of variance. Hence the resolvent 

R,{\) = (A - c)"i 

is a ^-valued analytic function on the domain |A| > 1. Our goal in this section is to calculate 
||i?c(A)|| to leading order as |A| | 1. 

Remark 2.1. Note that c is rotationally-invariant; it follows that if 6* G M then ||i?c(Ae~*^)|| = 
||i?c(A)||. Hence, we restrict our attention to the case A > 1 in M. 

For A > 1, define the positive operator Tx by 

T = Ta = RciXTRciX) = (A - c*)-\X - c)-\ (2.1) 

Note that ||i?c(A)|p = \\Tx\\. What's more, since T > it follows that ||T|| = inf spec (T"!), and 
= (A — c)(A — c*) = |A — cp is an operator we can handle with combinatorial techniques. In 
particular, we will now calculate the ^-transform of this operator, which will allow us to calculate 
the spectral measure of T^^ through Equations 1.2 and 1.3. 

2.1. The ^-transform of |A - cp. 

Theorem 2.2. Let cbe a circular operator of unit variance, and let A G R. Then 



Remark 2.3. We find the formula in Equation 2.2 interesting in its own right. It mirrors a similar 
formula for the semicircular equivalent provided in [7]; if s is a semicircular operator of variance 
h 

Their techniques are entirely analytic: indeed, one can calculate the Cauchy-transform of (A — s)^ 
from that of A — s, the latter of which is well-known, and then the .^-transform is achieved through 
Equation 1.3. Our approach below is markedly different, using only combinatorial techniques; 
however, analytic techniques will be developed to study the more general case in later sections, 
and we will rederive Equation 2.2 using those techniques in Section 3.1 

Proof. Expand |A - = A^ - A(c + c*) + cc*. Denote ai = -A(c + c*) and Q2 = cc*, so that 
|A — cp = A^ + ai + 02- The constant A^ is free from any operator, and so we have an initial 
simplification 

^\^_^\2{z) = +^„,+„2(z) = +I%a,+a,iz)- (2-3) 

Now, Qi, 02 are certainly not free. We calculate the ^-transform of their sum as a power-series 
whose coefficients are free cumulants: 

^ai+aiiz) = '^Kn[ai +02, + 02] z""'^ . (2.4) 

n>l 

The free cumulant k„ is a multilinear function, and so we can expand 

K„[ai + a2, •••,«! + 02] = ^ K„,[Qi^,...,ai„]. (2.5) 

(ii,...,j„)e{i,2}" 

We will shortly see that the vast majority of the 2" terms in the sum in Equation 2.5 are 0. To ease 
notation, let i denote the multi-index (ii, . . . , and denote the n-tuple , . . . , 0;^,^ as a^. Since 
02 = cc* is a product, for each i we can expand the cumulant in Equation 2.5 using Equation 1.7. 

i^n[ai\ = ^ (2.6) 

7reiVC(|i|) 
7rV0n = l|i| 

As in Remark 1.3, the list Oi is the expanded list of products from ai. For example, if i = (2, 1, 2, 1) 
so that Oi = [02, ai,a2, ai] = [cc*, ai, cc*,ai], then = [c, c*,ai,c, c* , ai]. 

Since ai = — A(c + c* ), any such cumulant A;^ [oi] can be expanded into a sum of cumulants k.,^ of 
a list of cs and c*s. Since the only non-vanishing block *-cumulants of c are K2 [c, c*] = K2 [c* , c] = 1, 
this means that the only vr which can contribute to the sum 2.6 are non-crossing pairings. This turns 
out to be an enormous simplification of the sum 2.5; the result is most of these terms are 0. Let us 
first consider the two endpoints. 

Suppose i = (l,l,...,l). The corresponding term in Equation 2.5 is 

Kn[ai, . . . ,ai] = {-X)"-Kn[c + C*,. . . ,0 + c*]. 

Expanding this in 2"^ terms, we have mixed cumulants in c, a*, only two of which are non- vanishing: 

K2[c,c*] = «;2[c*, c] = 1. Hence, 

K„,[ai, . . . ,q;i] = 2A^1{„=2}- (2-7) 
On the other hand, suppose i = (2, 2, . . . , 2). The corresponding term in Equation 2.5 is 

Kn[a2, . . . , 02] = Kn[cC*,CC*, . . . ,CC*]. 

Employing Theorem 1.2 and the above observation that only pairings contribute, we can expand 
this cumulant as a sum, 

Kn[cC* , . . . , cc*] = ^ K^[C,C*, . . . ,C,C*]. (2.8) 

7reJVC2{2n) 
7rv6?i = l2„ 
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In this case, 6„ = {{1, 2}, {3, 4}, ... , {2n — 1, 2n}}. Let vr be any pairing that connects these blocks, 
and consider the block in ir containing 1. Since ir is non-crossing, the match to 1 must be even 
(or there would be an odd number of points in between that could therefore not be paired in a 
non-crossing manner). Suppose 1 ~,r 2A;. If k < n, then there can be no non-crossing path joining 
2A; -f 1 to 2k, since such a concatenation of pairings would have to cross the pairing {1, 2k}, as 
demonstrated in Figure 3. 

1 2 k 

.• «! I* •. 

L_____IL_____I L_____II______IL_____II______I 



Figure 3. 

Hence, it must be that 1 2n. Now, consider the match to 2: say 2 r^^^ 2£ + I. If ^ > 1, 
then the point 3 cannot be cormected to 2 with a path composed of blocks in vr and 0„: since vr 
is non-crossing, the match to 3 must be either 4 or lie within the blocks {5, 6}, . . . , {2£ — 1, 21} . 
None of these blocks can be connected to any other blocks of 0„ via ir without crossing the pairing 
{2, 2£ + 1}. Hence, it must be that 2 ~^ 3. This is demonstrated in Figure 4. 



1 


i 2i + 1 








1 ; L 1 L 1 ; L 

1 isolated 





Figure 4. 

Iterating this argument shows that, in fact, there is only one pairing vr G NC2{2n) for which 
vr V 0„ = l2n'- the pairing Wn = {{1, 2n}, {2, 3}, {3, 4}, . . . , {2n — 2,2n — 1}} pictured in Figure 5. 



" " " 



Figure 5. The pairing zun is the unique pairing for which Wn V 0„ = l2n where 
0; = {{l,2},{3,4},...,{2n-l,2n}}. 

Hence, we have from Equation 2.8, 

Kn[a2, ... ,02] = K^Jc,C*,. . . ,C,C*] = K2 [c, C*]k2 [c* , c]""^ = 1. (2.9) 

Now, we must consider the remaining 2"~^ terms in Equation 2.5, all incorporating some mix- 
ture of ais and a2S. The following lemmas show that only n of these terms are non-zero. 

Lemma 2.4. Suppose i is a length n string containing both Is and 2s, and let 0„ denote the corresponding 
interval partition. If there exists vr G A^C2(|i|) such that vr V 0„ = l|i|, then i contains precisely two Is. 
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For the proof of Lemma 2.4, note first that |i| = #ls in i + 2 • #2s in i, and so for there to be 
any pairings, the number of Is must be even. Suppose, then, that u,v,w G {!,..., |i|} are distinct 
elements at positions corresponding to Is in i. The condition vr V 0„ = implies that there are 
paths 

V = Ui U2 ~(T2 • • • ~(T^-2 ~cr^_i Ur = U, 

V = 'Wl ~ri W2 ~r2 • • • ~r,_2 Ws-l ~r,_i Wg = W, 

where the sequences aj and tj alternate between vr and On. By assumption, v corresponds to a 
singleton in 0„, and so (assuming that the paths are "minimal" so that no number appears as two 
differently indexed uj or vj) we must have (Ti = n = vr. But vr is a pairing, so there is a unique 
point V2 with = ~ V2, and therefore U2 = W2 = V2- Then V2 ~cr2 ^3 arid V2 ~t2 ^3/ where 
(72 = T2 = On- If V2 is a singleton in On, then this is the end of both paths, meaning u = U2 = W2 = w, 
contradicting our assumption. Otherwise, the block of On containing V2 is a 2-block, in which case 
there is a unique with V2 ~(p v^, and so = = v^. Continuing inductively, we reach a 
contradiction to the fact that u ^ w. Hence, there must be precisely two Is in i. 

Remark 2.5. The above proof is really just the following trivial observation: the singletons in On 
must be ends of a path joining blocks, and a path can have only or 2 ends, thence 0„ can have 
only or 2 Is if it is to have this path-connected property. 

Lemma 2.4 shows that the only contributing i to Equation 2.5 are those of the form 2*^1 1 2*^2 \ ^'^'i 
for some ei, £2, £3 > 0. The next Lemma shows that either €2 = or ei = es = in order for the 
term to contribute. 

Lemma 2.6. Suppose i is a length n string containing hath Is and 2s, and let 0„ denote the corresponding 
interval partition. Ifi contains the substring (1, 2, 1, 2) or (2, 1, 2, 1), then Kn[ai] = 0. 

For the proof of Lemma 2.6, note that the discussion following Equation 2.8 may be applied 
locally, and so in a string of the form i = 2^^ 1 2*^2 i 2^3 where ei, 62, £3 > 0, any pairing vr that 
connects the blocks of 0„ must pair according to the dark lines of Figure 6. 



ei £2 £3 




Figure 6. 

The two singletons cannot pair together, since they would then be isolated by On. There are thence 
four positions where the two singletons may pair. If the two singletons pair to the ei and es 
blocks, then the e2 block is isolated, hence at least one singleton must pair to the €2 block, and 
then the other must pair outside the £2 block (or again that block would be isolated). Without 
loss of generality, suppose that the first singleton pairs to the £2 block (otherwise we could simply 
reflect the figure). It must therefore pair to the adjacent position (or else this position cannot pair 
anywhere without a crossing). The remaining singleton must pair to its adjacent block, 63, for 
otherwise the right-most open slot in the e2-block could not be paired without crossings. These 
pairings are represented in the light lines in Figure 6. This forces the remaining pairing in vr 
(between the right-most points in the ei and £2 blocks) to match a c* with a c*, resulting in a 
cumulant. Therefore, although this pairing does satisfy the connectedness condition vr V On = 
the cumulant Kn[a|i|] = 0. 
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Hence, at least one of 61,62,63 must be 0. If either ei or 63 is while the other two are > 0, 
the above argument (unchanged) gives the same result. So consider the case that 62 = while 
61 , 63 > 0, represented in Figure 7. The local argument above Figure 6 yields the necessity of the 
dark lines here. 



Figure 7. 



Since the two singletons cannot pair together (as that block would be isolated), there is only one 
non-crossing pairing, given by the light lines in Figure 7. Each of the dark lines gives a contribution 
K2[c*, c] (or K2[c, c*] for the outside pairing), yielding 1. The remaining pairings are K2[c*,ai] and 
K2[«i, c]. Hence, the index i = 2^^ 1 1 2^^ with 61, 63 > yields K„[ai] = K2[c*, ai] • K2[«i, c], which 
is non-zero (we calculate it below). 

Finally, we consider the case that only one of 61, 62, 63 is non-zero. (The case that all three vanish 
means that 7i = 2, and we will consider that case separately at the end.) Each of these three cases 
is really just a rotation of the one non-zero contributing case above: that is, a cyclic permutation 
of the string i = 2'^^ 1 1 2*^'* already considered. There are a total of n such permutations, and each 
contributes the same cumulant (this follows from the fact that cyclic permutations induce lattice- 
isomorphisms of A^C(|i|)). Hence, each of these n contributes the term K2[c*,ai] ■ K2[ai,c]. This 
completes the proof of Lemma 2.6. 

Let us now collect all terms contributing to Equation 2.5. For n > 2, Equation 2.7 yields that 
if i contains only Is then there is no contribution to the nth cumulant, and Equation 2.9 gives a 
contribution of 1 in the case that i contains only 2s. Lemmas 2.4 and 2.6 then show that among 
all other i, only n contribute a non-zero cumulant, each equal to the product K2[c*,ai] ■ K2[ai,c], 
which we now calculate: 

K2[c*,ai] = Hi2[c*, -A(c + c*)] = -A {k2[c*,c] + K2[c*,C*]) = -A 

K2[«1,C] = K2[-A(C + C*), C] = -A(k2[c, C] +K2[c*,c]) = -A. 

Hence, the total contribution is 

Kn[cti + 0(2, ...,«! + 02] = 1 + nX'^, n > 2. (2.10) 

For n = 1, Ki is the mean; ki[c] = ki[c*] = 0, and so Ki[ai + 02] = ki[cc*] = 1. The second 
cumulant nearly fits into the above analysis, but can be handled separately more easly: 

K2[ai + a2, ai + 02] = K2[ai,ai] + K2[ai, 02] + K2[a2, ai] + K2[a2, a2]- 

The two middle terms are odd, and since odd cumulants of c, c* are 0, these cumulants are 0. The 
first and last terms are included in Equations 2.7 and 2.9, and yield 2A^ and 1, respectively. Thus, 
from Equation 2.4 we have 

^ai+a2 {z) = l + {l + 2X^)z + + nA2)z"-\ 

n>3 

and so 

^^_,^2{z) = A^ + 1 + (1 + 2X^)z + J^(l + nA2)z"-i = ^(1 + nX^)z''-\ 

n>3 n>l 



yielding the power-series expansion of Equation 2.2 as required. 
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□ 



2.2. The support of the spectral measure of | A — cp. Denote by K\-x_^\2 the function 

K\x-c\^{z) = ^\x-c\^{z) + l/z. 

From Equation 1.3, G p^^{K\x_f.]2{z)) = z for small z G C+, where G^^ is the Cauchy transform of 
the spectral measure //a of |A — cp. The result of Theorem 2.2 yields 

1 1 A2 1 + (A2-1)z 

— I 1 = 

z 1 — z (1 — z)2 z{l — z 

For notational convenience, let 



^IA-cp(-) = 7 + T— + 7T— W = -7T— (2.11) 



m = X^-l K^ = K\x_,\2 Gm. = G^,. (2.12) 

So Gm ° Km{z) = z for small z G C+. Our goal is to determine the support of the measure Note 
that this support set is precisely the set of singular points for the Cauchy transform Gm, which we 
now set out to determine. The first derivative of Km{z) is given by 

The quadratic polynomial in the numerator has two zeroes, 

± _ -3 ± V9 + 87n 
4m 

Since m = A^ — 1 > it is easy to check that 

z- G (-00,0) and z+ G (0,1). 
Moreover, by factoring the polynomial in the numerator of Equation 2.13, one has 

^ 2m{z - z-){z - z+) 

which shows that 

K'^{z)<0 for z G (z~,0)U (0,z+). 

Thus is a strictly decreasing function on each of the intervals {z^ , 0) and (0, z^). Now, set 
s"^ = Km{z^)', then simple (though tedious) calculation yields 

± _ 27 + 36m + 8m2 ± (9 + 8m)3/2 
' " 8(m + l) ■ (^-^^^ 

Since lim^^oi Km{z) = ±oo. Km is a decreasing bijection of (z~ , 0) onto (— oo, s~) and i^m is also 
a decreasing bijection of (0, z"*") onto (s+, oo). Moreover since < s+ by Equation 2.14, Km is a 
bijection of {z~ ,0) U (0, z~^) onto (— oo, s~) U (s+, oo). Let 

Lm-. (-oo,s-)U(s+,oo) ^ (z-,0)U(0,z+) 

denote the (function) inverse of the above restriction of Km- Then Lm{w) = Gm.{w) for large values 
of Moreover, Lm is real analytic, and can therefore be extended to a complex analytic function 
Lm in a complex neighbourhood U of (— oo, s~) U {s'^, oo), which we can assume has only two 
connected components D (— oo, s~) and D (s"*"oo). By uniqueness of analytic continuation 
to open connected sets, it follows that Lm{w) = Gm{w) for all w e U. In particular, Gm has no 
singular points in (— oo, s~) U (s+, oo). Ergo, it follows that 

supp/^A ^ [s^,s^]- (2.15) 

Since K'^{z^) = 0, the graph of Lm has vertical tangents at the endpoints {s~ ,z~) and (s"*", 2;+). 
Therefore are both singular points for Gm., and we conclude that 

G supp/ZA. (2-16) 
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To prove that supp fix 



, s+], we apply the result of Voiculescu [18] which implies that the 



unital C*-algebra generated by a semicircular family {sj)j^j has no non-trivial projections. There- 
fore the spectrum spec(a;) of any selfadjoint element x G C* {{sj ; j € J} U {!}) is connected, and 
therefore is either an interval or single point. The standard circular operator c is equal to 



--^{si+is2) 



where si = -^{c + c* 



,S2 



7tM 



*) is a semicircular family with two elements. Hence, 
supp fix = spec ((A — c)*(A — c)) 



is either an interval or a single point; it now follows from Equations 2.15 and 2.16 that supp fi\ = 
[s~, s"'"]. In particular, s~ is the infimum of support of nx- Substituting — 1 for m in Equation 
2.14, we have the following. 



Proposition 2.7. Let cbe a standard circular operator, and let A > 1. Then 

8X^ + 20A2 - 1 - (8A2 + 1)3/2 



inf spec |A — c| 



8A2 



(2.17) 



Proposition 2.7 yields an exact formula for the norm ||i?c(A)|| = ||(A — c)^^||: it is the reciprocal 
of the square root of the expression in Equation 2.17, as discussed following Equation 2.1. We are 
primarily concerned with the leading order terms in this expression. It is easy to calculate the 
Taylor expansion of the function in Equation 2.17. The result is 



inf spec|A — c| 



32 
27 



(A-i)3 + 0((A-l) 



(2.18) 



Taking the reciprocal square root, and noting that ||c||2 = 1 and ||c||4 = 2 so that v{c) = 1, Equation 
2.18 proves Theorem 1.1 in the special case that the ^-diagonal operator a is a circular c. 

It is possible to compute the Cauchy transform Gm of fix explicitly using Cardano's formula for 
solving cubic equations: for w e C — R, the number z = Gm{w) is a solution to the equation 
Km{z) = w, which can be reduced to the following cubic equation in z: 

1 



2z^ + 



1 



m 



w 



0. 



w 



After determining the correct branch among the three solutions, one can then use the Stieltjes 
inversion formula of Equation 1.4 to show that fix has a density with respect to Lebesgue measure, 
and compute this density explicitly. Figures 8 and 9 below are based on such computations. 






Figure 8. The densities of the measures fix{dt), for A = 2, 3, and 10, from left to right. 
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Figure 9. The densities of the spectral measures {l/\^)*n\{dt) of the operators 
|A-c|-ifor A = 2, 3, and 10, from left to right. Note that |i2c(A)|2 = [{X-c*){X-c)]-^ 
has the same distribution as [(A — c)(A — c*)]~^ = |A — c|~^ since the von Neumann 
algebra generated by c is tracial; hence the density of |A — c|^^ is the same as the 
density of |i2c(A)|. 



3. Resolvents in the General ^-Diagonal Case 

This section is largely devoted to the proof of Theorem 1.1. The direct calculations used in 
Section 2 are not available in this case: for a general ^-diagonal operator a, it is far more difficult to 
find a closed-formula for the ^-transform of | A — op. Nevertheless, in Section 2.2, we determined 
the support of the measure through critical points, following [5]; through a similar approach, we 
will be able to calculate the norm of i?a(A) to leading order as it tends to oo, for any ^-diagonal 
operator a. 

Let a be ^-diagonal in a Ili-factor £/. Expanding £/ if necessary, we may choose a Haar unitary 
u *-free from a. It is easy to check that au has the same *-distribution as a; indeed, this can be used 
as a definition for ^-diagonality (cf. [15]). As such, for A > 0, 

\a - Ap ~ \au - A|2 = {au - X){u*a* - A) = (a - Xu*)uu*{a* - Xu) = \a - Xu*\'^. (3.1) 

Hence, the spectral measure of \a — X\ is the same as that of \a — Xu*\, and (a, Xu*) are *-free 
diagonal elements. We may now employ the following tool for calculating the ^-transform of a 
sum. 

Proposition 3.1. Let a, b be *-free M-diagonal elements, and for any self-adjoint element x let fj.^ denote 
its spectral measure. For any Borel probability measure fi on M, denote by jl the symmetrization of ji: for 
any Borel set SCR, jl{B) = \{ii{B) + n{-B)). Then 

A|a+f)| = /i|a| ffl (3-2) 

Proof. This is proved in Proposition 5.2 in [14]. A different proof is given in Proposition 3.5 in 
[4]. □ 

Applying Equation 3.2 to the preceding discussion, we have 

Of course, |Ati*| = A for A > 0, and so fl\xu*\ = ^{^\ + Therefore, the associated Cauchy 

transform is ^(jta + i^) = z'^~\'^ ■ Solving the equation w = ^rzrj^ for ^ yields z = ^'^^^2w^^ ~' 
and so Equation 1.3 yields that 

^ , , 1 + Vl + 4A2?i;2 1 Vl + 4A2u;2 - 1 
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(the sign of the square root is chosen so that ^ maps C_|_ into itself). Employing, finally, the 
additivity of the ^-transform over free convolution, we have the following. 

Proposition 3.2. Let a be ^-diagonal. Denote by /j, the symmetrization and let ii\ denote the sym- 
metrization il\a~x\- Then 

^^^{z) = M^{z) + ^ , for small z£C+. (3.3) 

Remark 3.3. If a measure ji is supported in [0, oo), then ^ and fl contain the same information. 
Hence, Proposition 3.2 actually allows the determination of the spectral measure of | A — a| . Despite 
this, a direct derivation of from Gjx is not obvious in general (the latter is the odd part of the 
former). However, both have the same square. That is, if □ is the map □(x) = for a; G M, then 
the push-forwards D*^ = D^/i are equal and so too are the Cauchy transforms in the case that /U is 
supported in [0, cx)). 

3.1. The .^-transform of | A — - analytic approach. To demonstrate the power of Equation 3.3, 
we will now use it to give an alternate, entirely analytic proof of Theorem 2.2. 

Analytic proof of Theorem 2.2. Let v\ denote the spectral measure of |A — cp = (A — c)(A — c*), and 
V = uq the spectral measure of |cp = cc*. Following the notation in Proposition 3.2 and Remark 
3.3, with i^Lx = /i|A-c| srid /i = we have v\ = D^/iA and v = D^Z-i where □(x) = for x G M. 
Note that /i|c| is the quarter-circular law n\c\{dt) = |:-\/4 — i^l[o,2](*) dt, and so /i is the standard 
semicircle law ^{dt) = — t^l[_2,2] (*) dt. Thence ^%^{z) = z, and Equation 3.3 reads 

^ , , Vl + 4A2z2 _ 1 

I%^^{z)=z + ^ , (3.4) 

for small z G C+. In this case, the precise domain is easy to determine since is analytic ev- 
erywhere; the domain of analyticity above is\z\ < Adding 1/z, Equation 1.3 shows that the 

functional inverse G^t^^^ is given by 



, l^, , , Vl + 4A2z2 + l 1 

gi:hz)=z+ , 0<N<-. 

Put s = v^l±^iE±l; then small \z\ corresponds to large The quantity s is best characterized as 
a solution to the quadratic equation 2:52 — s — zX"^ = 0, and so in terms of s we have 

s 



S2 _ ^2 ' 

which evidently makes sense for large |s|. Thus, Equation 3.4 may be written in the form 

( ^— n + ) ' for |s| large. (3.5) 



Now, let us consider G^^ = Gn^^^. We have 



— I ( — \ — 1 l^xidt). 

2w \w - t w + t ' 



(3.6) 



A change of variables and the fact that is symmetric shows that both integrands above have 
the same integral, and so Equation 3.6 becomes 
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Substitute w = ^-zlx^ + ^ ^^^^ Equation 3.5, and Equation 3.7 yields that for large \s\, 



2\ / \ -1 



Inverting G^,^ in Equation 3.8 and again using Equation 1.3, we get for large |s|. 



^''n 1 + s2 - A2 y vi + s2_A2 

2 

S 



— a? 



{l + s'- X') (3.9) 



Finally, set z = (1 + — A^)^^, so that large |s| corresponds to small \z\. Then 

•S^ — A^ 9 1 , 9 , 

1-^ = ^ ^2—7' s2 = - + a2-1. 

— A^ + 1 

Substituting into Equation 3.9, we find that for small / 0, 

- ,(-) = (--^ + A^-l) ' ' 



(1-Z)2 Z 



z-^ - 1 A2 1 



+ n Z^2 - - (3-10) 



(1-Z)2 (1-Z)2 Z 

1 1 A2 1 A2 
- + -71 ^ = Z + 



Z{l-Z) Z ' {I-ZY l-Z ' (1-Z)2' 

which is the desired result. □ 

Remark 3.4. The above proof is rather shorter than the one in Section 2.1. It relies on the somewhat 
sophisticated analytic result of Proposition 3.2; on the other hand, the proof in Section 2.1 relies 
on the sophisticated combinatorial result of Theorem 1.2. The benefit of the combinatorial proof is 
that it provides a direct explanation for all of the terms in the the ^-transform of u\, which is the 
reason we've included it. 

3.2. Analytic Continuation and Roots of G^^ . Our goal is to use Equation 3.2 to determine (to 
leading order) the smallest positive singular value of , which is the reciprocal of the spectral 
radius of the resolvent Ra{X) = (a — A)~^ of our ^-diagonal operator a in the Ili-factor . Let <f 
denote the trace on . Adding 1/z to both sides, rewrite Equation 3.2 in the form 

G^, M^{z) + = for small z ^ 0. (3.11) 



(3.12) 



Following Section 4 of [5], we introduce the auxiliary functions 

h{s) = s if [{aa* + s^y^) , s>0 
hx{s) = s (((a - A)(a - A)* + s^)"!) , A, s > 0. 
Then, as proved in Lemma 4.2 in [5], 

G^,{is) = -ih{s), s > (3.13) 

G^,iis) = -ihxis), X,s>0. (3.14) 

Using Equation 3.13 together with the definition of in terms of G^, it follows that there is some 
large sq > so that 

^^{-ih{s)) - -rj^ = is, s > So- (3.15) 
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Combining Equations 3.11 (with z = —ih{s)), 3.14, and 3.15, we have 

, / 1 Jl-AX^h{s)A , ,,,,, 

for large s — say s > for some sx> sq- 

Based on Equation 3.16, it was proved in Proposition 4.13 of [5] (see also Lemma 4.8 and Defi- 
nition 4.9) that hx{t) can be obtained from h for alH > in the following way. 

Proposition 3.5. For every t > 0, the equation 

{s - t) (tt^ ^ = (3-17) 

^ '\h{s)-s + tj 

has a unique solution s = s(A, t) in the interval {t, oo), and 

hx{t) = h{s{X,t)), t>0. (3.18) 
Corollary 3.6. Let s > 0, and assume that 1 — 4A^/i(s)^ > and 

^ 1 Vl-4AW ^0 
2h{s) 2h{s) 

Then 

Proof. Put t = s — 2^ — — — 2h(s) ■ Then s > t (since h{s) > from Equation 3.12). It is a simple 

matter to check that s is a solution to Equation 3.17, and therefore s = s{X,t). Hence, Equation 
3.19 follows from Equation 3.18. □ 

Remark 3.7. Comparing Equations 3.16 and 3.19, we see that that point of Corollary 3.6 is that the 
negative root may also be chosen in the determining equation for hx- This results in the similar 
alternate version of Equation 3.11 in Proposition 3.8 below, identifying what turns out to be the 
correct singular value of . 

In the following, we assume that A > ||a||2. Since spec(a) C B{0, ||a||2) (cf. [4]), a — A is invertible 
and therefore supp (fix) ^ M \ {—6x, Sx) for some 6x > 0. Therefore G^^ is defined and analytic in 
a complex neighbourhood of 0. 

Proposition 3.8. For all z in a small complex neighbourhood ofO, 

G„ (*„(--) + i^4±i^)=.. (3.20) 

Proof. Since fi is symmetric, its odd moments are all 0. Also, since = fj.\a\^, the even moments 
are given by 

"i2fc(/^) = Ikllifc, A; = l,2, ... 

Therefore, 
Hence, 



Gf,{z) = - + ^ + Ol^], |z|^oo. 



M.) = -.G,(z.) = i + Mi + o(l) 



oo. 



It follows that 



s{ 1-^ + Oi ill, s^oo 



h{s) " \^ s2 
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and 



Taking t = t{s) as in the proof of Corollary 3.6, we have 



^""^ ~ 2h{s) 2h(s) 

= (A^-IHIi)^ + o(l 

and therefore t{s) > for s > s'^ for some s';)^ > 0. Thus, by Corollary 3.6, 

hx{t{s)) = h{s) for s > s'^ 
which is precisely the statement of Equation 3.20 in the case z = —ih{s). Since 

1 - Vl + 4A2^ 



lim 

^^0 



2z 



0, 



the left-hand-side of Equation 3.20 is well-defined and analytic in a complex neighbourhood of 0. 
Hence, as we have shown that the equation holds for all 2 in an imaginary line segment accumu- 
lating at 0, Equation 3.20 holds everywhere in a complex neighbourhood of 0. □ 

3.3. Negative Moments. From now on, let us normalize a so that ||a||2 = 1; we therefore only 
consider A > 1. We also assume that a is not Haar imitary, which (under this normalization) is 
equivalent to the requirement that ||a||4 > 1. Put 

MA = ||a-A||, mx = \\{a-X)-^\\-\ 

Then 

supp {^x) C [-Ma, -mx] U [?tia, Ma]. 
Moreoever, ±mA and are singular points for the Cauchy transform 

Gma(-2)= / -^^^\{dx), z G C- ([-MA,-mA]U [mA,AfA]). 

J^Z — X 

Our goal is to determine the asymptotic behaviour of mx as A J, 1; we will accomplish this through 
Equation 3.20. 

The free cumulants Kn{fi) vanish for odd n since /i is symmetric, and so is given by the 
power series 

S^niz) = K2{fJ')z + K4(/i)z^ + KG{fJ,)z^ H 

for z in a complex neighbourhood of 0. Moreover, since is centred, 

K2(m) = rn2ifJ.) = IMl = 1 

and 

K4{fi) = m/i{^) — 2 = ||a||4 — 2. 

Define 

v{a) = \\a\\\-{\\a\\lf = K4{^,) + l. (3.21) 
Then v{a) is strictly positive. We have 

^^{z) = z + {v{a) - l)z^ + K6(^)z^ + 0(/). 
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Now, we may expand \/l + AX^z^ as a Taylor series about 0. The result is 

1- Vl + 4A2z2 1 / ^/i/2^ 



oo 

f=i 

= -X^z + A^z^ _ + . . . 

Here Cfc is the Catalan number Ck = i^^) ■ Now, following Equation 3.20, define 



B^{z) = M^{z) + ^-^ . (3.22) 



1 - VI + 4A2z2 

From the preceding discussion, B\ has a the power series expansion 

Bx{z) = {l-\^)z+ {v{a) - 1 + A^) + - 2A^) / + O(z^), (3.23) 

in a complex neighbourhood of 0. 

By Proposition 3.8, Bx{z) and G^j^ {w) are inverse functions of each other when both \ z\ and \ w\ 
are small. Since supp {^\) C M \ {—5\, 6\) with 5^ > 0, we have for < 5x, 

M(-5a,<5a) 2; V ^ / 
= - (m_2(/iA) + m-3{f.tx) up' + m_4(^A) H ) , 

where 

rri-kifJ-x) = / x~^nxidx), fc = l,2, ... (3.25) 

are the negative moments of /Ua. Again, since fix is symmetric, m^kifJ-x) = for /c odd. We 
will now use the Lagrange inversion formula, together with Equations 3.23 and 3.24, to express 
the even negative moment m^2i{f^\) in terms of the free cumulants K2(^a), K4(^a), • • • , i^2i{f^\) for 
£ G N. In particular, as can be easily calculated directly. 

Lemma 3.9. Let v > 0. The inverse of the function 

F{z) = z - vz^ 

has the power series expansion 

oo 
k=0 

■v(2) _ _l(3k\ 



^hereClf> = ^,{'l:). 

bers cj^^ in Le 



Remark 3.10. The numbers cj^^ in Lemma 3.9 are the p = 2 case of the Fuss-Catalan numbers 



^ pk + l\ k 

Note that when p = 1 we recover the standard Catalan numbers. The appearance of these combi- 
natorially interesting numbers in free probability theory is discussed at length in [10] and [12], in 
addition to more recent papers and preprints [1], [8], and [9]. 
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Proof. Since F is odd, so is its inverse. Write the (yet-to-be-determined) coefficients of as 
F^~^^ (w) = w + b^w'^ + b^w^ + • • • . Since F is analytic at and F'{0) / 0, the Lagrange inversion 
formula states that 

Writing = (1 — ti2;^)~(^'""+^) • z"^'^^, we see that the residue in question is the coefficient 

of in the power series expansion of (1 — which is equal to 

^,, (2fc + l)(2fc + 2)---(3fc) ^ ^^l^k 

This proves the lemma. □ 
Theorem 3.11. Let kbea non-negative integer. Then as X I 1, 

^n-2k-2i^^x)-cl^ \/}"l',,^, . (3.27) 

Remark 3.12. The appearance of the Fuss-Catalan numbers to leading order in Equation 3.27 for 
the negative moments of fix begs for a combinatorial explanation. Indeed, there is a completely 
combinatorial proof of Theorem 3.11; this is the content of Section 4.1. What's more, the lower 
bound of Theorem 1.1 can be proved with a simple estimate directly from Theorem 3.11; this is 
the content of Section 4.2. 

Proof. Put V = v{a) > 0. Referring to Equation 3.22, rescale Bx and set 

Fx{z) = -(A2 - l)-^/^Bx ((A2 - 1)V2,) . (3.28) 
Then following Equation 3.23 and using Equation 3.26, we have 

Fxiz) = z-iv-l + A^)^^ - (a2 - 1)(k6(m) - 2X<')z^ - (A^ - 1)^(^8 + SA^)^^ + • • • 
Hence, the coefficients in the power series for Fx{z) converge to the coefficients in the power series 

F{z) = z-vz^ + Oz^ + 0z'^ = z- vz^ 
as in Lemma 3.9. Therefore, by the continuity of the Lagrange inversion formula, the coefficient 
4fc+i z'^^'^^ in F^^^^^ (z) converges by Lemma 3.9 to cj^^v'' as A J, 1; that is. 

Now inverting Equation 3.28 (setting u = (A^ — l)z), 

and therefore for \w\ small, 

G,Auj) = B^-'\w) = {X'-l)'/'Ft'^ 



w 



.(A2 _ 1)1/2 ^(-1> 



(A2 - 1)3/2 

W 

(A2 - 1)3/2 



(3.30) 



the last equality following from the fact that f| is an odd function. 

Now, m^2k~2{fJ'x) is the coefficient of u;2^+i in the power series expansion of G^^ {w), and so by 
Equations 3.30, 

and hence the Theorem follows from Equation 3.29. □ 
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3.4. Proof of Theorem 1.1. The convergence of the coefficients of Fx to those of F as A | 1 is not 
enough to prove our main theorem. In fact, the convergence is stronger, as we now show. Choose 
g G (0, j) small enough that is analytic in B{0, g). Then 



Bx{z) = ^^{z) + - 



2z 

is analytic in -6(0, g) for all A G (1, 2). Thence, with Fx as in Equation 3.28, Fx is analytic and 
well-defined in .8(0, g/VX'^ - 1). Note that g/VX^ - 1 ^ oo as A j 1. 

Proposition 3.13. With Fx as in Equation 3.28 and F as in Lemma 3.9, Fx{z) F{z) uniformly on 
compact subsets ofC as X I 1. 

Proof. We claim there exists a constant C > and a g' > such that, for all z G B{0, g') and all 
AG (1,2), 

\Bx{z) - (1 - A2)z - {v{a) - 1 + X^)z^\ < C\zf. (3.31) 
For the moment, assume Equation 3.31 has been proved. Then setting w = (A^ — l^/^z, 

\Fx{z)-z + {v{a)-l + X'^)z^\ 

=1 _ (a2 _ ly^/^Bx ((A2 - 1)V2^) _ , + (^(«) _ 1 + x')z'\ 

-(A^ - 1)-^/2s.(z«) - j^^^ + (^'(a) - 1 + A^) ((X^^) ' 

=(A2 _ i)-y^\Bxiiv) - (I - X^)w - (via) - 1 + X'')w% 
and by Equation 3.31 this is 

< _ l)-3/2 = (A2 - l)-3/2 C\{X^ _ 1)1/2^|5 ^ ^(;^2 _ -^)|^|5 

for z G 5(0, g'/VX?^). Thus, we have 

Fa(z) -F(z) = Fxiz) - {z-v{a)z^) = Fx{z) -z + v{a)z^ = Fx{z) -z + {v{a) - 1 + A^)z^ + (1 - A^)^^, 
and so 

\Fx{z) - F{z)\ < \Fx{z) - z + iv{a) - 1 + A^)^^! + (A^ - l)\z\' 
< (A^-1)|z|3 + C7(A2-1)|z|5. 

This proves the proposition. Hence, it remains only to verify the estimate of Equation 3.31. Refer- 
ring to Equation 3.23, 

oo 

Bx{z) - (1 - X^)z - {v{a) - 1 + A4)z3 = ^[^M - {-lYc^.X''] z^'~\ 

e=3 

It is convenient to break this up as a sum of two power series, Bx{z) = D{z) + Cx{z) where 

D{z) = Ke{fi)z^ + K8{n)z'^ + ■■■ 

Cx{z) = -CaAS^ + CsA^z^ 

Now, D{z) is a truncation of the power series for =^^(z), which is convergent and analytic in 
-6(0, g); as D{z) has a of order 5 as 0, it follows that there is a constant Ci > \Ke{iJ.)\ such that 
|-D(z)| < ci\z\^ in that neighbourhood of 0. On the other hand, note that Ck < 4*^, and so with 
A < 2, 

|C.(.)| < 2i°|z|^ + 2^^|zr + . . . = 2i°-^L, 

1 — 1d|z|^ 

and so choosing g' < 1/4, we may choose a constant C2 > with |Ca(z)| < C2\z\^ for z G B{0, g'). 
Setting C = ci + C2, this proves Equation 3.31. □ 
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Lemma 3.14. For all X > 1 sufficiently close to 1, there is a unique x\ G (0, ^) (where v = v{a)) such 
that the real analytic function x ^ F'^{x) has the following sign variation in [—^i -^]- 

• F'^{x) > Ofor X £ {-xx,xx)- 

• F'^{x) = 0/or X = ±xx- 

• < Oforxr < \x\ < 

Moreover, xx^ ^ and Fxixx) i^)^^"^ fls A j 1. 

Proof. The uniform convergence of Fa to F on compact subsets of C implies by standard complex 
analysis that F^'^ (the pth derivative of Fx) converges uniformly to F^^^ on compact subsets of C, 
for each p G N. Since F^'^^ {x) = —6v < 0, we can choose Aq > 1 such that for all A G (1, Aq), 

FfVx)<0, for Ixl < (3.32) 

Hence, for A G (1, Aq), F" is strictly decreasing on [—^, :^]- Moreover, F"{0) = since Fa is an 
odd function. Therefore, 

F'^{x)<0 for xG(0,^). 
Hence F^ is strictly decreasing on [0, for A G (1, Aq). Moreover, 

nmFj;(0) = F'(0) = l 

li^F'x{^) = F'{^) = -2. 

Therefore we can choose Ai G (0, Aq] such that 

F{(0) > and F'x{^) < for < A < Ai. 

Hence, for all such A, the equation F'^{x) = has exactly one solution xa in the interval (0, ^), 
and F'^{x) > for x G [0, xx), while F((x) < for x G (xa, ■^). Since F( is an even function, the 
above-stated sign variation holds for all A G (1, Ai). 

Now, note that is a critical point for F, and also for e G (0, ^), 

F'(^) > and F'(^) < 0. 

V3t) Vol) 



Therefore, by the uniform convergence, 

for A > 1 sufficiently close to 1. It follows that < xa < eventually as A J, 1. This shows 



that limAj.1 xx = and by the uniform convergence of Fa to F on [—v, v], it follows that 

limFA(xA)=F(^) = (2|,)^/\ 

as required. □ 
This finally brings us to the proof of the main theorem. 

Proof of Theorem 1.1. It follows from Lemma 3.14 that for all A > 1 sufficiently close to 1, F;[~^^ has 
an analytic extension to a complex open neighbourhood of the interval (— Fa(xa), Fa(xa)), but that 

both endpoints of the interval are singular points of F^ . Since 



G,J.) = -(A2-l)V2Fi-^> (- 



(A2 - 1)3/2^ 

for \z\ small, it follows that G^^ has an analytic extension to a complex neighbourhood of 

h = (-(A^ - 1)'/'Fa(xa), (A2 - 1)3/2Fa(xa)) , 
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but the endpoints of the interval are singular points for . Therefore, 
while 

h n supp(^a) = 0- 

Since fi\ = ll\a~\\, it follows that 

for A > 1 sufficiently close to 1. As A | 1, this tends (by Lemma 3.14) to 

23/2 _ 1)3/2 y 4 J ' 

thus proving Theorem 1.1. □ 

4. The Combinatorics of Negative Moments 

In this final section, we provide a new combinatorial framework for even moments of the ab- 
solute resolvent of an ^-diagonal operator (that is, negative moments of |A — ap where a is ^- 
diagonal and A > ||a||2)- This approach, through partition structure diagrams, is used in Section 4.1 
below to give a new proof of (a more refined statement of) Theorem 3.11. In Section 4.2, we show 
how knowledge of these moments alone yields the sharp lower bound of Theorem 1.1. 

4.1. Partition Structure Diagrams. Let us normalize a once again so that ||a||2 = 1, and let A > 1. 

For convenience, let r = 1/A G (0, 1). Then we may rewrite iia(A) = (A — a)~^ = r(l 
Hence, for any positive integer k, 

|ii„(A)|2('=+i) = [(A - a)(A - a*)]-('=+^) = [(A - a*yHX - a)"^]^^^ 

= r^ik+i) [(I - ra*rHl - rar']'^\ 

Expanding the geometric series inside this term gives 



ra 



1-1 



2{fc+l) ^ ^2(fe+l) 



n>0 m>0 



k+1 



Expanding this product of summations we have 

|_^^^_;!^)|2{fc+l) _ ^2(fc+l) ^noH hn^+moH hruk g*ni) ^mo _ _ ^*nk ^m^ 



no,...,nj. 
mo,...,mj. 



Since a is ^-diagonal, it is rotationally-invariant, and so only monomials with equal numbers of 
a and a* can have non-zero mean in the state Lp. Thus, 



^{^^) = ^ (|/2„(A)|2(^-+i)) = r2(^-+i) Y2 r2("«+-+"fc) (^(a^'^Oa™" • • • a*"^a"^'=). (4.1) 



nQ,...,-nf. 
mo,...,mj, 
no + --- + nj, = moH hmj, 



We now employ Equation 1.6 expanding the mean as 

■K&NC{no,mo,...,nk,mk) 

Notation 4.1. The following shorthand notations will make the text below far more readable. 

• Let n, m stand for multi-indices (no, . . ■ , n^) and {mo, . . . , mfc). 

• The sum of a multi-index no + ■ ■ ■ + Uk is denoted |n|. 

• Denote the interleaved multi-index (no, niQ, . . . , n^, m^) as n /X\ m. 

• Denote the cumulant K2^[a,a* , . . . ,a, a*] as a^{a); note, by traciality, ai{a) is also equal to the 
cumulant K2i[a* ,a, . . . , a*, a]. 
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Because a is .^-diagonal, tlie mixed cumulant K^[a*'"'° , a'™" , ■ ■ ■ , 0*'"*= , 0'"^*=] is a product of terms 
a£(a) with £ between 1 and k + 1. The precise product is determined by the block profile of the 
partition tt. 

Definition 4.2. Let n, m fee multi-indices, and let tt e NC{n/^\m). The block profile pr(7r) is the multi- 
index p = {pi, . . . ,Pk+i), where pi is the number of blocks in vr of size 2, p2 is the number of blocks in vr of 

size 4, and so forth. Notethat, in this case, the sum \n\ + \in\ = 2|n| is equal to 2pi+4p2-\ \-2{k+l)pk-\-i- 

Denote this number as e(p), 

fc+i 

e(p) = 2^J>i. 

By definition, if pr(7r) = p, then the mixed cumulant K7r[a*'"'', a'™'^, . . . , 0*'"*=, a'"^*] is equal to 
ai{ay^ ■ ■ ■ ak+i{ay''+''- . Denote this product as a(a)P. We can thus re-index the sum in Equation 
4.1 as follows: 

m_2fc-2(/iA) = r2^'=+^) J2 Yl alajP'-Wr^^P'-W). (4.2) 

The idea now is to reindex the sum over n, m and vr in terms of a new set of combinatorial 
objects: partition structure diagrams. Set Vk+i = {(1, 1), (1, *), (2, 1), (2, *),..., (A; + 1, 1), {k + 1, *)}; 
view Vfc+i as vertices (in sequence) on the boundary of a disc. It is possible to encode all the 
information in a pair n, m and a partition vr G NC{n /X\ m) succinctly in terms of Vk+i, as follows. 
Consider subsets of V^+i of the form P = {{vi, 1), (wi,*), . . . , {ve, 1), {we, *)} where vi < wi < 
V2 < W2 < ■ ■ ■ < ve < We and i < k + 1; viewed on the disc, P is a convex 2£-gon whose vertices 
alternate between 1 and *. (Note: ^ = 1 is allowed - a 2-gon is a line-segment.) 

For any n, m, vr G NC{n /A m), assign non-negative integers to the polygonal subset P of T4+i 
as follows. P is assigned the number of 2^-blocks in vr that connect vertices in sequence as follows: 
a vertex from the vi run of Is, then a vertex from the wi run of *s, then a vertex from the f 2 run 
of Is, and so forth. As such, a triple n, m, vr yields an labeled polygonal diagram, or LPD. Figure 
demonstrates this procedure. 



(1.1) 




(4,1) 



TT e NC(n A\ m) the resulting LPD 

Figure 10. A partition in TT G NC{n A\ m), where n = (3,4,2,5,1,1) and 
m = (4, 2, 3, 2, 4, 1), together with the resulting labeled polygonal diagram. 2-gons 
are drawn as straight line-segments, while non-degenerate polygons are drawn 
non-convex so all lines and labels are clearly visible. Those polygonal subsets not 
appearing in the LPD have label 0. 



Note, in Figure 10, those polygons with more than 2 sides have label 1 or 0. This is a general 
phenomenon, as the reader can easily check: since vr is non-crossing, there can be at most one 
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block of size 21 {I > 1) connecting a set of vertices in the same 1- and *-runs. However, this 
restriction does not apply to 2-blocks: there can be many nested pairings, as seen in Figure 10. 
There is a simple geometric explanation here: for fixed n, m, the map from vr to the LPD (viewed 
as a collection of inscribed polygons in a disc) is a compression. Two (2i?)-blocks {£ > 1) in vr 
with the same image under the compression would have non-trivially intersecting interiors, and 
therefore would cross. Since 2-blocks have no interior, on the other hand, they can compress to 
the same 2-gon without crossing. 

Not every LPD is the compression of a partition: as explained, to come from a partition, the label 
of any non-degenerate polygon must be or 1. But there are further restrictions. For example, if 
the label of the 2-gon joining (2, 1) to (4, *) in Figure 10 were non-zero, no non-crossing pairing 
could compress to that LPD. The restriction, of course, is that the polygons with non-zero label 
in the LPD should be non-crossing as well. This brings us to the notion of a(n unlabled) partition 
structure diagram. 

Definition 4.3. A partition structure diagram or PSD with 2{k + 1) vertices, D, is a collection of 
polygons, each with an even number of sides, inscribed in the disc Vk+i, with the following additional 
properties: 

• The vertices of any polygon P e D are among the vertices Vk+i, and any edge in P connects a 1 
with a *. 

• The intersection of any two polygons in P has area; that is, if any two intersect, it is along a 
common set of edges or vertices only. 

Denote the set of all such D by PSDk+i- 

Following our discussion, ii D = (Pi, . . . , P^) is a PSD, and we label it with positive integers 
L(Pi), . . . , L{Ps), then it can only be the compression of a partition in some NC{n, m) if the label 
of any non-degenerate polygon in is 1. Call such a labeling L valid. 

In fact, it is easy to see that any such labeled PSD is the compression of a unique partition. Here 
is the algorithm: begin by decompressing each labeled 2-gon into the requisite number of nested 
2-blocks. Then, to avoid crossings, each remaining non-degenerate polygon can be inserted in one 
and only one way - closer to the center of the disc than any surrounding 2-blocks. 

Hence, there is a bijection between the set {(n, m, vr) ; n, m G N''"'"^, |n| = |m|, vr G A^C(n /X\ m)} 
and the set {{D, L); D £ PSDk+i,L is a valid labeling of D}. What's more, this bijection pre- 
serves the required statistic pr(7r) in a recordable way: if D G PSDk+i with component polygons 
Pi, ... , Pg, and if L is a valid labeling of D, then the number of 2^-blocks in the unique vr whose 
compression is (P>, L) is equal to X^PeD I = ^^1 (here |P| denotes the number of sides of 

P). Hence, we can refer to the profile pr(P>, L). Note, in particular, that for this vr, 

e{w{^)) = L{P)\P\. 

We will refer to this sum simply as e{D, L). 

We can thence re-index the summation of Equation 4.2 as follows. 

m-2.-2(^A) = r^^'^^^) Y: E o.iar^''''''>r^(^'^\ (4.3) 

D&PSDk+i L labels D 

The sum in Equation 4.3 is quite complicated, but fortunately we are only looking for the leading 
order term in ^2^1 = j^-^. To achieve this, it is convenient to break up the sum into two parts: 
over those D with polygons having no more than 4 sides, and the remaining D that contain a 
polygon with at least 6 sides. Denote these two sets as PSD-^j^^ and PSD^j^^. 

For the sum over PSD^^^, we break up the sum according to the profile: look at those D con- 
taining s 2-gons Pi,. . . ,Ps and t 4-gons Qi , . . . , . In this case, from our previous discussion, any 
valid labeling gives label 1 to each of Qi, • • • , Qt (the label must be < 1, and since we suppose each 
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is present, the labels must be 1). On the other hand, valid labels for Pi, . . . ,Pg range independently 
among the positive integers (again, is excluded since we suppose all s are present). For such a 
labeling L, we have 

e{D, L) = 2{L{Pi) + ■■■ + L{Ps)) + 4(L(Qi) + • • • + L{Qt)) = 2(L(Pi) + • • • + L(P,)) + At. (4.4) 

For the cumulant term, the profile pr(Z:», L) has L{Pi) H h L{Ps) 2-blocks, and L{Qi) H h 

L{Qt) =t 4-blocks, and so 

However, ai(a) = K2[a, a*] = ||a||| = 1 by our normalization, and so so the cumulant term simply 
becomes 

So, letting r/i, . . . , r/^ denote the labels L{Pi), . . . , L{Ps) G N — {0}, we can express the sum over 

PSD^i, as 

^2(fc+l) ^ ^ ^(^)pr(D,L)^.(D,L) 

D&PSD^l^ L labels D 

(4.6) 

= ,2(.+i)^n,+i(.,t) X] a2(a)V2(m+-+..)+4i^ 

s,t=0 r)i,...,r]s=l 

where nfc+i(s,t) = #{D G PSD^^^ ; Dhass 2-gons & t 4-gons}. The internal sum (over rji, . . . ,r]s) 
factors as a product of s independent summations. 



E 

?7i,...,r?s=l 




^2(7?i+-+r,,)+4t ^ y^^2j? ^ ^4t 



So, summing over PSD^^^ yields 

s,t=0 \ / 

Of course, the indices s, t really have finite ranges: the 2-gons are chosen from among {^~^^) and 
the 4-gons from ('^4^) possible configurations, meaning that the constant nfc+i(s, t) is for large 
enough s, t. Since we seek the highest-order term in jrz^, we are interested in the largest s for 
which nfc_|_i(s, t) / for any fixed t: we expand the binomial to the power s in Equation 4.7, and 

are interested only in the term ( ) = (x^-i)s of highest order. 



J (A^-l)'* 

The key observation here is that, for any diagram D with only 2- and 4-gons, additional 2-gons 
may be added without crossings until the skeleton of 2-gons partitions the area of V^+i into 4- 
gons - i.e. it produces a tiling of Vk+i by 4-gons. There may be many distinct 4-gon tilings that can 
result from such a completion. Nevertheless, this means that, to enumerate those D with s 2-gons 
and t 4-gons, we may begin by considering any possible 4-gon tiling of V^+i, and then consider 
all possible ways of including s lines and t 4-gons in it. This construction is purely combinatorial 
and the details are left to the interested reader. The procedure is exemplified in Figure 11. 

The maximal s, for given t, for which nfc_|_i(s, t) / 0, is therefore given by the number of line- 
segments in a 4-gon tiling of V^+i. The following classical results may be found in [16]. 

Lemma 4.4, The number of line-segments in any A-gon tiling of a {2k + 2)-gon is 3fc + 1 (including the 
boundary edges). The number of such distinct tilings is given by the Fuss-Catalan number 
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(1,1) (1,1) 




(4,1) (4,1) 

Figure 11. A diagram in PSD^'^ with 7 2-gons (shown as straight lines) and 2 
4-gons. An additional 9 2-gons can be added to produce a tiling of the 12-gon by 
4-gons. 



Remark 4.5. It is well-known that the Catalan number Cj. = C^,^^ counts the number of 3-gon 
(triangular) tilings of a (/c + 2)-gon. This is the p = 1 case of the following theorem, proved in [16]; 

The number of {p + 2)-gon tilings of a (pk + 2)-gon is the Fuss-Catalan number cj^^K The second 
statement of Lemma 4.4 is the p = 2 case of this theorem. 

Thus, the largest s for which Ilk+i{s, t) / is s = 3fc + 1, provided t is not so large that there 
cannot be t 4-gons inserted into the tiling provided by the 3k + 1 2-gons. It is an easy matter to 
count that there are k distinct 4-gons in any 4-gon tiling of Vk+i, and so we have 11^4.1(5, t) > 
whenever < s < S/c + 1 and < t < k. For fixed t in this range, there are (^) distinct choices of 
positions for the t 4-gons out of the k slots. Hence, we have proved the following: 

Uk+i{3k + l,t)= (^^ Cf\ 0<k<t. (4.8) 

And, of course, 11^+1(5, t) = for t > k. Combining Equations 4.7 and 4.8, we have that the 
leading-order coefficient in is contained in the expansion of 

3fc+l 



r2(^+i)J^n,+i(3A: + l,t) {a2{a)r^)' 

(i^-O^'eC) (May) 

^ ^ t=0 ^ ^ 



The sum over t simplifies, via the binomial theorem, to (1 + a2{a)r'^^^ . On the other hand, if we 

/ N 3fc + l 

expand the binomial to the power 3/c + 1, and reserve only the highest order term ( -yz^ ] 1 we 



find that the leading order contribution to the PSD^^^ sum is given by 

+ (l_,2)3.+i - (4-9) 

At this point, it is useful to manipulate the expression to bring it to a more familiar form. If we 
return to the variable A = 1/r, Equation 4.9 becomes 



\-2(fc+l) 
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which equals 



. , , \-2k-2 \62+2 

^(2) /^4 ^ 

(A +Q2(a)) A j^^—^^^. 



ci^' ' \.. • (4.10) 



Simplifying the last expression, and substituting the value a2{a) = ||a||4 — 2 = 7;(a) — 1, we have 
finally that the leading order term in the PSD^^^ expansion is 

,(2)(A^-1+^)^- 
(A2 - l)3fc+l 

(Equation 4.10 should be compared, favourably, with Equation 3.26.) It is worth noting that, since 
A^ — 1 is divisible by A^ — 1, we could again expand Equation 4.10 as a binomial, and the leading 

/ \ 3fe+l 7. 

order term in the highest power ( j is simply C^, 'v{af. This is the desired entire lead- 
ing order term according to Theorem 3.11, and so we must now argue away all the terms in the 
PSD^j^-^ expansion. That is, summing up and returning to Equation 4.3, we have 

(2) (A^-l + t;(a))^ 

rn-2fe-2(/^A) = CI (>^2_i)3fc+l 

DgP5D|?_|_i L labels D 

+ lower order terms in —r^ — . 

A^ — 1 

We now must proceed to show that the middle terms in Equation 4.11 are all sub-leading order. 

Remark 4.6. Consider the special case a = c is circular. Here ai{a) = for i > 1, and so the only 
terms in Equation 4.3 that contribute come from diagrams with only 2-gons. Referring to Equation 
4.7, in this case we have the exact formula 

3fc+l ^ 

(2) 

The leading coefficient (with s = 3A; + 1) is ; the lower-order coefficients are very challenging 
to calculate. Nevertheless, the fact remains that the moment is a polynomial in jrzj' which is a 
point of independent interest. 

Now let us consider the PSD^^^ terms in Equation 4.11. In fact, we can give a general expansion 
like the one given above for any allowed profile of PSD D. Consider those D with si 2-gons, S2 
4-gons, and so on through Sfc_|_i 2{k + l)-gons. (The condition that D is in PSD^_^^ means there 
is some ^ > 3 with Si > 0.) Denote the 2£-gons present as Pf,...,Pg^. Valid labeling of such 
polygons again takes the following form: each of through Pg_^ can be labeled with any positive 
integer, while those P^ with i > I must have label 1. The general expansion is then 



«lv,Sfc+i rii,...,risi=l 

where 11^+1 (si, . . . ,Sk+i) is the number of D G PSD^j^i with si 2-gons, S2 4-gons, and so on 
through Sfc+i 2{k + l)-gons, and in this general setting we have 

a(a)Pr = a2{ay^ ■ ■ ■ afc+i(a)"'=+i , e = 2{r]i + ■ ■ ■ + r]s,) + 4s2 + ■ ■ ■ + 2{k + l)sk+i. 

(The expression for a(a)P'" should also contain the term ai(a)''^"' '"^^i, but as above the normal- 
ization ||a||2 = 1 sets this term equal to 1.) Again the internal sum simplifies to a product, and we 
have 

m.2k-2{f^x) = r"^'^'^ E nk+iisi,...,s,^,)r^^^+^^^+---^'('^^'>^^^ (4.12) 
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(This expression is completely general and we could have started with it instead of considering 
the case that se = for £ > 2 as we did. Also, in the preceding notation, the terms Ilk+i{s,t) 
would now be denoted 11^+1(5, t, 0, . . . , 0).) The range of each sj is through a finite set, and so 
again we see it is only the 2-gons that yield an infinite expansion - all other terms are polynomial 
in = 1/A^. Now, consider the portion of this sum corresponding to PSDf,j^^: those terms for 
which at least one S£ with £ > 2 is non-zero. The following lemma shows that the leading order in 
■j^rzi cannot be achieved in this case. 

Lemma 4.7. Let 2 < £ < k + 1, and suppose that S£ > 0. Then 11^+1(5, S2, ■ ■ ■ , Sfc+i) = Ofor s > 3A; + 1. 

/ -, \3fc+l 

In other words, there are no ( ) or higher-order terms in the PSD^^-^^ expansion. The idea 

behind the proof is quite easy: the maximal number of non-crossing 2-gons in V^+i is 3A; + 1, which 
is achieved by any 4-gon tiling. According to Lemma 4.4, any 2^-gon Pj in D with £ > 2 can be 

(2) 

subdivided into £ — 1 4-gons (in C^„^ distinct ways) by adding £ — 2 lines. The modified D can 
have no more than the maximal number, 3A; + 1, line segments, which means that the original D 
can have no more than {3k + I) — {£ — 2) 2-gons; if £ > 3, this means the maximal order is not 
achieved. This is demonstrated in Figure 12. 



(1.1) 



(6,1) 



(5,.) 



(5,1) 




(2,1) 



(2,*) 



(3,1) 



(4,1) 

Figure 12. A PSD D with a 6-gon. This 6-gon can be subdivided into two 4- 
gons (demonstrated by the dotted line), showing that D cannot have the maximal 
possible number (16) of 2-gons. 

Thus, all of the PSDf.^-^^ terms in Equation 4.11 contribute to SM&-leading order, and referring to 
Equation 4.11, we have thus proved that 



m_2fc-2(/"A) = C 



(2)(A^-l + ^(«))^ 



+ lower order terms in 



(4.13) 



As explained above, we may safely ignore the — 1 term inside the numerator as it cancels 
to yield lower-order contributions. As such. Equation 4.13 provides our combinatorial proof of 
Theorem 3.11. In fact. Equation 4.13 is a finer statement, whose character warrants further study. 
On the other hand, the combinatorial approach shows that all the expressions involved are in fact 
polynomials. We record this as the following theorem, which is evident from Equation 4.12 noting 



that 



1 



1 



Theorem 4.8. Let a be i% -diagonal in a lli-f actor with trace ip, normalized so that \\a\\2 
Then there is a polynomial P^j^i in two variables so that 

^[(A-a)(A-a)1-('=+^) = P^+/ ^ ^ 



1, and let k >0. 



A2 - 1 ' A2 



for A > 1. 
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Remark 4.9. It should be noted that, following Equation 4.12 and referring to Equation 4.9, the 
polynomial P^_^i{x, y) can be taken so that its leading term in x is 

This result looks quite different from the expression we would expect from Equation 4.13. The 
point here is that the two variables x, y of P^^i are not truly independent, since the instantiations 
X = jTzi and y = ^ lead to the relation x — y = xy. Using this, it is possible to transform the above 
expression into many other forms, and so it is impossible to speak of the polynomial in Theorem 
4.8. 

4.2. Theorem 1.1 via negative moments. The asymptotic upper-bound of Theorem 1.1, with a 
non-sharp constant, is actually non-asymptotic. This is an easy application of the strong Haagerup 
inequality in [10], whose one-dimensional case (which is needed here) was really proved in [12]. 
In short: with a .^-diagonal normalized so that ||a||2 = 1, the spectral radius of a is < 1 and so for 
A > 1 we may write (A — a) = A(l — ra) where r = l/A<lasin Section 4.1. Then we may expand 

oo 

(A - a)-^ = r(l - ra)-^ = 

n=0 

Hence ||(A — a)^^\\ < rY^j^r"'\\a"'\\. By Corollary 3.2 in [12], ||a"|| < Y^i/n||a|| since ||a||2 = 1, 
and so we have (A — a)^^ < Y^r||a|| J2n v^''" = \/e A~-'^||a|| Yin y/ne~^'^^^. It is well-known 
that the series Yn '^^e*" is bounded above and below by constant multiples of fP"^ for t, p > 0. 
Expanding In A as a power series in (A — 1), we then have 

||(A-a)-i|| < V^||a||A-i(lnA)-3/2 ^ ||a||(A- l)-3/2^ ^ | 1, (4_14) 

Remark 4.10. In addition to a non-sharp constant, the estimate of Equation 4.14 scales with ||a|| 
rather than the correct quantity \/v{a) from Theorem 1.1. Since both v{a) and 1 1 a 1 1 can be expressed 
in terms of aa*, which (for ^-diagonal a) can have arbitrary compactly-supported distribution on 
[0, oo), it is easy to see that ||a|| can be arbitrarily large compared to ^v{a). Hence the analytic 
argument of Section 3 is necessary to get these sharp results. 

Remark 4.11. In fact, it is true that \/v{a) < \\a\\ under the normalization ||a||2 = 1; this is not 
obvious since the two sides scale differently. Note that f (a) = HaHl — 1 = || aa* Hi — 1- Let v be the 
distribution of aa*; then the supremum of suppz^ is \\aa*\\ = ||a|p. The condition ||a||2 = 1 means 

II II 2 

that the mean of aa* is 1, and so Jq tfi{dt) = 1. But this means that the measure u{dt) = tfi{dt) 
is also a probability measure on [0, ||a|p], and thus we have 

||a||| = ||aa*||^ = / r fi{dt) = / tu{dt)<\\af. 
Jo Jo 

This shows that the upper-bound of Equation 1.1 is an improvement over the one derived from 
the strong Haagerup inequality. 

In fact, the loioer bound of Equation 1.1 in its sharp form can be proved easily from Theorem 3.11, 
which can itself be seen as a combinatorial result a la Section 4.1. The idea is to use the following 
simple estimate. Let x, y be bounded commuting positive semi-definite operators. Then x < \\x\\ 
in operator sense, and so xy < ||x||y. Applying this with y = x^ for some positive integer k, we 
have x''^^ < ||x||x'''; continuing inductively this yields x'^'^^ < HxH'^'x, and so applying a state ip to 
both sides, (/?(x'^+-^) < ||x||*'V(^)- 

Now, apply this to x = i2a(A)*i?a(A), so that 

||x|| = \\Ra{X)f, 

ifix) = ip{Ra{X)Ra{XT) = ||(A - a)-i||2 = m_2(^A), 
V9(x'=+^) = ^ (((A - a)(A - a)*)^('=+i)) = m^2k^2{f^x). 
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Thus, we have 

\\{X-a)-'r> "'~''~'^^'\ (4.15) 
Applying Equation 3.27, this ratio is asymptotic to the following: 

m^2if^x) C7(2)^;(a)0(A2 - l)"i 

Taking 2/eth roots, we see from Equation 4.15 that as A | 1, || (A — a)~^ || is bounded below by 

(C^f VHa){X' - I)-"' - (Cf )^/'' 2-3/2 v^(A _ i)-3/2. 
Stirling's formula shows that 



k~>oo V / 2 

and this gives precisely the sharp constant \ in Theorem 1.1. 
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